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ABSTRACT 

Over the past several years, structural studies 
have led to the unexpected discovery of iron- 
sulfur clusters in enzymes that are involved in 
DNA replication/repair and protein biosynthesis. Al- 
though these clusters are generally well-studied co- 
factors, their significance in the new contexts of- 
ten remains elusive. One fascinating example is a 
tryptophanyl-tRNA synthetase from the thermophilic 
bacterium Thermotoga maritima, TmTrpRS, that has 
recently been structurally characterized. It repre- 
sents an unprecedented connection among a primor- 
dial iron-sulfur cofactor, RNA and protein biosyn- 
thesis. Here, a possible role of the [Fe 4 S 4 ] cluster 
in tRNA anticodon-loop recognition is investigated 
by means of density functional theory and compari- 
son with the structure of a human tryptophanyl-tRNA 
synthetase /tRNA complex. It turns out that a cluster- 
coordinating cysteine residue, R224, and polar main 
chain atoms form a characteristic structural motif for 
recognizing a putative 5' cytosine or 5' 2-thiocytosine 
moiety in the anticodon loop of the tRNA molecule. 
This motif provides not only affinity but also speci- 
ficity by creating a structural and energetical penalty 
for the binding of other bases, such as uracil. 



INTRODUCTION 

Iron-sulfur clusters play a variety of important roles in 
the biochemistry of the cell. For a long time they have 
been known to facilitate electron transport, as for example 
in ferredoxin-type proteins (1,2), but also enzymatic con- 
version reactions, with the tricarbonic acid cycle-enzyme 
aconitase being a famous example (3-5). The significant 
progress of structural biology over the last two decades that 
culminated in structural genomics initiatives has shown that 
iron-sulfur clusters are far more widespread than previ- 
ously thought and appear at prominent positions in cru- 
cial cellular processes such as protein biosynthesis (6-8) 
and DNA replication and repair without, however, directly 



unveiling their functional role (9-11). It could be demon- 
strated for example that the [Fe4S4] cluster harbored by the 
MutY protein (Escherichia coli) facilitates electron transfer 
steps that occur during DNA damage recognition and re- 
pair (12-15). Frequently, however, the exact functions of the 
iron-sulfur clusters remain elusive. This is the case for the 
recently structurally resolved DNA Helicase XPD (10,11) 
(Xeroderma pigmentosum D in human cells or Ding in 
E. coli) and Trp-tRNA synthetase (TrpRS) from the ther- 
mophilic bacterium Thermotoga maritima (Tm TrpRS). Ac- 
cording to current knowledge, the emergence of iron-sulfur 
clusters appears to be connected exclusively with bacterial 
life, an idea that is supported by the fact that in eukaryotic 
cells, intact mitochondria are required for cluster biosyn- 
thesis (16,17). This observation, together with the seemingly 
self-sustained existence of chemosynthetic bacteria and ar- 
chaea at hydrothermal vents along the mid-oceanic ridges, 
provides support for an iron-sulfur-based origin of life as 
proposed by Wachtershauser (18). We might never be able 
to uncover fully the processes that took place billions of 
years ago and eventually resulted in the first life forms, 
but the occurrence of an iron-sulfur cluster in a tRNA 
synthetase nevertheless provides a fascinating and sugges- 
tive extant connection among an ancient cofactor, nucleic 
acid-based information decoding and protein biosynthesis. 
Whether the cluster in this particular enzyme is really a rem- 
iniscence of early life on this planet or just the result of a 
secondary adaptation process to the extreme environment, 
its host organism lives in, cannot be easily answered. In any 
case, the assignment of a functional role in the process of 
aminoacylation is a critical first step in addressing this ques- 
tion. 

The crystal structure of TmTrpRS, recently solved by 
Han et al. (7), consists of a catalytic domain with canonical 
Rossmann fold and a tRNA Trp anticodon-binding (TAB) 
domain that harbors an [Fe 4 S 4 ] cluster (see Figure 1). Struc- 
tural superpositioning with human TrpRS (i/sTrpRS) in a 
complex with bovine tRNA (ifr-tRNA) led the authors to 
speculate about a possible involvement of the cubane in an- 
ticodon recognition. Here, the properties of the peculiar Fe- 
S cluster are studied by means of density functional theory 
(DFT), and the interaction with its direct protein environ- 
ment is analyzed. Due to the lack of structural informa- 
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Figure 1. Tryptophanyl-tRNA synthetase from Thermotoga maritime! 
(TmTrpRS). Left panel: Overall fold of one monomer of the homod- 
imeric enzyme, showing the embedding of the [Fe4S4] cluster in the tRNA 
anticodon-binding (TAB) domain. Right panel: Detailed view of the Fe-S 
cubane with the neighboring residues R224 and H242. The models shown 
were generated from the crystal structure by Han et a!. (7) (PDB entry 
2G36). Element color code — gray: C, blue: N, yellow: S, brown: Fe. 

tion about how 7mTrpRS binds to its cognate tRNA Trp , the 
//sTrpRS/ifr-trRNA complex is used to derive a minimal 
model for studying the recognition of a plausible 5' base in 
the anticodon by the TAB domain of TmTrpRS. It turns out 
that a characteristic structural motif formed by the cubane 
and side and main chain atoms of the enzyme could provide 
a specific interaction site suitable for tRNA base recogni- 
tion. 

MATERIALS AND METHODS 

All-electron calculations were carried out with the DFT 
programs provided by the TURBOMOLE suite (19) on model 
structures of the tryptophanyl-tRNA synthetases from 
Thermotoga maritima (PDB entry 2G36 (7)), Aeropyrum 
pernix (PDB entry 3A05 (8)) and Homo sapiens (PDB en- 
try 2DR2 (20)). Amino acids were included by saturating 
their carboxy-termini with NH-CH3 groups and their N- 
termini with acetyl groups. The coordinates for the cap- 
ping groups were taken from the preceding or following 
residues in the amino acid sequence. During structural op- 
timizations the C, O and N atoms of the main chain were 
kept fixed. All other atoms, including C a atoms, were al- 
lowed to relax freely. H atoms were added with the Mol- 
Probity server (21). As starting geometries for the bases, 
cytosine, 2-thiocytosine and uracil the coordinates of 5' 
cytosine in the anticodon of fir-tRNA Trp (complex struc- 
ture 2DR2 (20)) were chosen. For 2-thiocytosine and uracil, 
atoms were edited accordingly. Structural alignments were 
generated with Pymol (22). In the case of the complex struc- 
tures of TmTrpRS and ApTrpRS, steric clashes between 
the enzyme models and the ligands were resolved by force 
field pre-optimization, using the force field MMFF94 (23) 
as implemented in Avogadro (24). For quantum mechan- 
ical structural optimizations, DFT was applied with the 
TPSS (Tao, Perdew, Staroverov, and Scuseria) exchange- 
correlation functional (25). A triple-^ def2-TZVP basis set 

(26) , referred to as TZVP throughout the paper, was chosen 
and, optionally, empirical dispersion corrections (DFT-D3 

(27) , labeled by TPSS-D3 in the manuscript) were consid- 
ered as provided by the Turbomole package (19). As shown 




Figure 2. Molecular model of the TAB domain of TmTrpRS considered in 
this study. Left panel: Model extracted from the TmTrpRS crystal structure 
(PDB entry 2G36). Cysteine residues C236, C259, C266 and C269 were in- 
cluded together with their main chain atoms, as well as R224 and H242, 
and the main chain atoms between the C a atoms of R227 and G23 1 . Only 
polar hydrogen atoms are depicted. Hydrogen atoms/protons were added 
by using the MolProbity Server (21). Right panel: Overlay of the starting 
model (black) shown in the left panel and the structure optimized with 
TPSS/TZVP for the 2 [Fe 4 S 4 ] 1+ state of the cubane cluster (colored). El- 
ement color code — white: H, red: O, gray: C, blue: N, yellow: S, brown: 
Fe. 

previously, the combination of the TPSS functional with a 
TZVP basis is well suited to describe the active centers of 
iron-sulfur proteins (28-30). To accelerate calculations, ad- 
vantage was taken of the resolution-of-the -identity approx- 
imation (RI) with the corresponding auxiliary basis sets 
(31). In order to account for screening effects of the protein 
matrix and the solvent, the polarizable continuum solvent 
model, COSMO (32), was considered, with a dielectric con- 
stant of e = 4. Homology models of TrpRS enzymes were 
generated with the SWISS-MODEL web service (33-38). 
Electron density maps (not shown in this paper) were calcu- 
lated for the crystal structures 2G36, 3A05 and 3JXE with 
the crystallography package Phenix (39) using the published 
structure factors. Images of molecular structures were cre- 
ated with Pymol (22). 

RESULTS AND DISCUSSION 

Minimal model and cluster properties 

In the crystal structure of TmTrpRS by Han et al. (7), 
the Fe-S cluster located in the putative anticodon-binding 
(TAB) domain appears as a regular [Fe4S/i] cubane (see Fig- 
ure 1). For the theoretical analysis of its properties pursued 
here, a model was chosen that comprises the cluster with the 
four coordinating cysteine residues and all side chain and 
main chain atoms that are in close contact with the cubane 
moiety and that could play an additional role in base recog- 
nition (see below). The TniTrpRS model, which is depicted 
in Figure 2 (left panel), is large enough to investigate all rele- 
vant questions addressed in this work while still being com- 
putationally tractable with DFT. 

As of now, there have been no experimental data available 
that would help to assign the redox and magnetic states of 
the cluster. For this reason, a paramagnetic 2 [Fe4S4] 1+ and a 
diamagnetic '[Fe4S4] 2+ state, which are commonly encoun- 
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Figure 3. 7/nTrpRS TAB domain surface and potential tRNA interaction. 
Left panel: Solvent-excluded surface of the TAB domain of TmTrpRS near 
the [Fe4S4] cluster that shows that the S atom of C266 is directly solvent- 
exposed and forms a potential binding groove together with the residue 
R224. Right panel: Superposition of TmTrpRS (orange) with the structure 
(gray) of the complex of human TrpRS (Z/sTrpRS) and bovine tRNA (Bt- 
tRNA Trp ). 

tered in other proteins harboring Fe-S cubanes, were con- 
sidered. For an approximate description of the diamagnetic 
state, broken-symmetry (BS) DFT was applied (40-43). 

As can be seen from the crystal structure, the cubane is 
deeply embedded in the TAB domain, with only the sul- 
fur atom of C266 being directly solvent-accessible (Figure 
3, left panel). In addition to the coordinating residues C236, 
C259, C266 and C269, the cluster is in direct contact with 
R224, which presumably forms an H-bond with the sulfur 
of the cubane facet bordered by C259 and C266 that faces 
away from the protein surface. As the solvent-excluded sur- 
face shows, the guanidinium group of R224 is partially ex- 
posed to the solvent (see Figure 3, left panel). The second 
sulfur atom of the aforementioned cubane facet is within 
H-bonding distance of the main chain NH group of D221 
(see Figure 2, left panel). H242, being electrically neutral 
according to the prediction by the program package Mol- 
Probity (21), forms an H-bond with the sulfur atom of C269 
(see Figure 2, left panel). The main chain atoms from R227 
to G231 of Tm TrpRS were included in the model because 
they provide a framework to hold R224 in place. Indeed, 
structural optimization with TPSS/TZVP resulted in only 
small deviations with respect to the crystallographically de- 
termined atom positions (see Figure 2, right panel). This 
holds true for both oxidation states considered. 

In order to analyze the electronic structure of the opti- 
mized models, the number of unpaired electrons on selected 
cluster atoms, N a , , — TVp, ,, was calculated as well as the 
partial charges obtained by Mulliken population analysis 
(see Table 1). Additionally, optimizations of just the cluster 
with its coordinating cysteine residues were performed to 
determine whether both properties are affected by the pres- 
ence of the amino acids in direct contact with the cubane, 
R224 and H244. As can be seen from Table 1 , the spin po- 
larization of the iron atoms of the Fe-S cluster is almost un- 
changed when the isolated cubane and the Tm TrpRS model 
are compared. A reduced spin polarization on Fe4 in the re- 
duced oxidation state can also be found in both cases and is 
therefore not caused by the closeness of R224 or H244, and 
rather due to the fact that the number of electrons is uneven 
for the 2 [Fe4S4] 1+ state as opposed to the anti-ferromagnetic 
singlet, 1 [Fe 4 S 4 ] 2+ . On the S atoms of C266 and C269, only 




Figure 4. Interaction between tRNA and TrpRS. Left panel: Detailed view 
of the complex structure HsTrpRS / BMRNA Trp of Shen et al. (20) (PDB 
entry 2DR2) that shows the interaction of the anticodon bases with key 
residues and main chain atoms of the synthetase. The three bases of the 
anticodon, 5'-CCA-3', are explicitly labeled (Cyt, Cyt, Ade). Right panel: 
Detailed view of the hybrid structure of TmTrpRS and 5/-tRNA that was 
obtained by superposition of TinTrpRS (PDB entry 2G36) and ZMrpRS 
(PDB entry 2DR2). The 5' cytosine (Cyt) is located between the Fe-S 
cubane and R224. Because the current study focuses on this cytosine, the 
other two bases are not labeled explicitly for the sake of clarity. Element 
color code — red: O, gray/orange: C, blue: N, yellow: S, brown: Fe. 

small residual spin polarization could be detected. Whereas 
the residues R224 and H244 do not affect spin polarization, 
they do apparently somewhat alter the charge distribution 
in the cluster. As a result, the Fe atoms Fe 3 and Fe 4 , belong- 
ing to the facet that is almost collinear with the axis formed 
by the imidazole and the guanidinium group of H244 and 
R224, are more positively charged, and the iron atoms Fej 
and Fe2, of the opposite facet, are more negatively charged 
than the corresponding atoms of the free cubane (see Table 
1). Interestingly, the charges on the two S atoms considered 
are almost unchanged by the presence of R224 and H244. 



Comparison of ZfsTrpRS and TmTrpRS TAB domains 

Currently, only one crystal structure of a TrpRS/tRNA 
complex has been solved (at 3.1 A resolution) and con- 
sists of human TfrTrpRS and bovine 5/-tRNA Tr P (20). Al- 
though TmTrpRS and //.sTrpRS share only 13.4 % identical 
residues, both proteins feature a high degree of structural 
conservation, especially in the anticodon-binding (TAB) 
domain (see Figure 3, right panel, 3.6 A r.m.s.d. overall). 
This similarity allowed Han et al. to speculate about pos- 
sible protein/tRNA interactions for the bacterial enzyme 
for which no complex structure is available to date (7). The 
location of the [Fe 4 S 4 ] cluster in the center of the TAB do- 
main could indeed allow for direct interaction with the anti- 
codon loop of a tRNA molecule and suggests a possible role 
in anticodon recognition. How tRNA proofreading can in 
general be accomplished by aminoacyl-tRNA synthetases 
is exemplified by the //.sTrpRS/.B?-tRNA Trp complex struc- 
ture (20). TTsTrpRS recognizes tRNA Trp by its discriminator 
adenine in the tRNA acceptor stem as well as by specific in- 
teractions with tRNA base moieties in the anticodon loop 
(20) (see Figure 3, right panel). 

Figure 4 shows the interplay among the three bases of the 
anticodon-loop, 5'-CCA-3', of 5/-Trp-tRNA and /faTrpRS 
as present in the crystal structure of the complex. The ade- 
nine is in H-bonding contact with D382 and possibly has 
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Table 1. Number of unpaired electrons (N a- , — A r p i ,) on atom i, derived from the difference of alpha and beta spin densities (p tt] i — pp, and Mulliken 
partial charges for the Fe atoms of the [Fe4S4] cluster and S atoms of C266 (Si) and C269 (S2) of 7)j;TrpRS 



Fei Fe2 Fe3 Fe4 Si S2 



[Fe 4 S 4 ] 1+ 


3.2 


-3.0 


3.2 


-2.7 


-0.1 


0.1 


[Fe 4 S 4 ] 2+ 


3.1 


3.1 


-3.1 


-3.1 


-0.1 


-0.2 


7mTrpRS-[Fe 4 S 4 ] 1+ 


3.1 


-3.0 


3.2 


-2.7 


-0.1 


0.1 


Lm lipKb— [re 4 S 4 J 


3.1 


3.0 


—3.0 


—3.0 


— 0.1 


—0.2 


r;«TrpRS(H244A)-[Fe 4 S 4 ] 1 + 


3.1 


-3.0 


3.2 


-2.7 


-0.1 


0.1 


r«7TrpRS(H244A)-[Fe 4 S 4 ] 2+ 


3.1 


3.0 


-3.1 


-3.0 


-0.1 


-0.2 










Partial charges 








F ei 


Fe2 


Fe 3 


Fe 4 


Si 


s 2 


[Fe 4 S 4 ] 1+ 


0.43 


0.39 


0.43 


0.38 


-0.58 


-0.58 


[Fe 4 S 4 ] 2 + 


0.40 


0.37 


0.40 


0.38 


-0.49 


-0.45 


TmTrpRS-[Fe 4 S 4 ] 1+ 


0.42 


0.36 


0.45 


0.42 


-0.56 


-0.57 


TmTrpRS-[Fe 4 S 4 ] 2+ 


0.38 


0.34 


0.41 


0.43 


-0.47 


-0.46 


7mTrpRS(H244A)-[Fe 4 S 4 ] 1 + 


0.42 


0.36 


0.41 


0.42 


-0.56 


-0.55 


TmTrpRS(H244A)-[Fe 4 S 4 ] 2+ 


0.37 


0.35 


0.38 


0.43 


-0.48 


-0.42 



[Fe 4 S 4 ] refers to just the cubane with coordinating cysteine residues, ThiTrpRS assigns the T)j;TrpRS model studied here and ZmTrpRS(H244A) refers to 
the same model where H244 was replaced by alanine. The structures were optimized with TPSS-D3/TZVR Without explicit dispersion interactions the 
numbers are almost unchanged (see Supplementary Table S 1 ), except for the H244A mutant where no BS solution could be converged. For each compound, 
different BS solutions were tested and the one corresponding to the lowest energy is given. The energetical deviation, however, never exceeded 2 kcal/mol. 



TT-interactions with F377; the cytosine following in 5' di- 
rection forms H-bonds with the side chains of S378, K431, 
T427, and with the main chain oxygen atoms of G380 and 
R38 1 . The 5' cytosine is in H-bonding contact with the main 
chain NH-groups of T427 and G428, and its NH 2 group 
seemingly interacts with H387 (see Figure 4, left panel). 

In order to identify possible interactions that would facil- 
itate tRNA recognition in TmTrpRS, the latter was super- 
imposed with the complex structure of the human tRNA- 
synthetase and bovine tRNA Trp using the structural align- 
ment algorithm as implemented in Pymol (final r.m.s.d. 
3.6 A) (22). The superposition of TwTrpRS and .ffcTrpRS 
shows that in the given setting, a direct interaction between 
the tRNA anticodon loop and the [Fe4S4] cluster would be 
possible, with the 5' cytosine pointing directly to it (see Fig- 
ure 4, right panel). Apparently, this interaction could be 
supported by R224, which interacts with the Fe-S cluster, 
as discussed above (see Figure 4, right panel). Remarkably, 
the structural motif formed by the cubane moiety and the 
guanidinium group of R224 somewhat resembles the situa- 
tion in /faTrpRS, where H387 is most probably involved in 
anticodon recognition (see Figure 4, left panel). The super- 
position locates R224 at almost the same position as H387 
in iftTrpRS. Of course, structural alignment of the complex 
structure and 7»jTrpRS cannot provide accurate atom co- 
ordinates, and due to sequence variations and deviations of 
main chain coordinates, clashes occur between the bovine 
tRNA molecule and the bacterial enzyme. 

Base recognition and binding affinity 

The superposition of TmTrpRS and TfrTrpRS predicts that 
the [Fe4S 4 ] cluster predominantly interacts with the 5' cy- 
tosine of the anticodon. Therefore, the quantum mechani- 
cal study of tRNA anticodon recognition by TfriTrpRS pre- 
sented here focuses on the binding of the 5' base in or- 
der to render the computational problem tractable. One 



unknown in this context is created by the fact that tRNA 
base modifications have not yet been characterized in Ther- 
motoga maritima. Hence, the precise modes of interac- 
tion between TmTrpRS and its cognate tRNA might dif- 
fer from those studied here. However, unmodified cytosine 
as it appears in the anticodon loop of bovine tRNA Trp is 
clearly the structurally simplest option and shall serve as 
an example to study how the peculiar Fe-S cluster could 
be involved in base recognition. Along with cytosine, 2- 
thiocytosine is considered as a known base modification 
in tRNA molecules in order to account for variations in 
the anticodon. Deciphering the correct 3' base in a mRNA 
codon (corresponding to the 5' base in the anticodon of 
the tRNA) is most important in the case of tryptophan as 
it is the only amino acid besides methionine that is rep- 
resented by a single triplet in the genetic code (5'-UGG- 
3'); a wobble base in this position would therefore interfere 
with translational fidelity. As a consequence, tryptophanyl 
tRNA-synthetases have to recognize the correct 5' base of 
the anticodon with great accuracy. 

Cytosine and 2-thiocytosine, which are considered in this 
study, are represented by the base moieties together with 
their associated sugar residues. In order to avoid unrealistic 
rearrangements during structural optimization due to iso- 
lated charges and polar groups that would be compensated 
in the native enzyme, the hydroxy and bridging phosphate 
groups of the sugar molecules were removed. Only the 2' O- 
group was included and neutralized by a proton (see Figure 
5). Note that, for the sake of simplicity, the 5' base models 
considered here are referred to by the names of their nu- 
cleobases (cytosine, 2-thiocytosine, uracil) consistently and 
not by the naming convention for the corresponding nucle- 
osides (cytidine, 2-thiocytidine, uridine). This prevents fre- 
quent switching between nomenclatures. 

The strength and specificity of the interaction between 
the 5' base of the anticodon loop and the TwTrpRS model 
was determined by calculating the interaction energy by 
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T427 G428 



H387 




Figure 5. Interaction of i/.?TrpRS and 7mTrpRS models with 5' cyto- 
sine. Left panel: TPSS/TZVP optimized structure of the //.sTrpRS model 
and cytosine. Dispersion interactions were not considered explicitly. Right 
panel: TPSS/TZVP optimized structure of the TmTrpRS model and cy- 
tosine. The oxidation state of the cubane was [Fe4S4] 1+ . Dispersion in- 
teractions were not considered explicitly. H244 is not shown explicitly for 
the sake of better visibility of cytosine binding. Distances are given in 
angstroms. Element color code — white: H, red: O, gray/orange: C, blue: 
N, yellow: S, brown: Fe. 



means of DFT. In order to judge whether the energies 
obtained are indicative of a significant interaction, analo- 
gous calculations were performed for the human //sTrpRS 
model. DFT typically underestimates dispersion interac- 
tions and has therefore been complemented (ad hoc) by 
empirical correction terms to alleviate this shortcoming 
(44,45). Therefore, results are given with and without the 
explicit consideration of dispersion interactions throughout 
this paper. 

At first, in order to resolve structural clashes between the 
ligand atoms (cytosine/2-thiocytosine) and the TwTrpRS 
model that were caused by the simplistic structural- 
alignment approach, a constrained molecular mechanics 
optimization was performed, where only the bases and the 
attached sugar moieties were allowed to relax structurally. 
The pre-optimized coordinates were then used for full quan- 
tum mechanical optimizations. 

The optimization of the iMrpRS/cytosine model cor- 
roborates and refines the information provided by the X- 
ray structure published by Shen et al. and shows that the 
5' cytosine is held in place by interactions with main chain 
atoms and H387 (see Figure 5, left panel). Specifically, the 
carbonyl group of cytosine interacts with the peptide-NH of 
G428. The imine-N of the pyrimidine ring is in H-bonding 
contact with the peptide-NH of T427 and the amino group 
of cytosine interacts with the peptide carbonyl group of 
R381 and with the imidazole side chain of H387. Evalu- 
ating the energetic contribution to tRNA recognition and 
binding reveals a rather weak interaction of —7 kcal/mol if 
dispersion interactions are not explicitly considered in the 
DFT approach (see Table 2). Dispersion interactions may 
play a role in non-specifically stabilizing the base in its posi- 
tion, for their consideration leads to a much stronger bind- 
ing energy of —32 kcal/mol. 

As the results in Table 2 show, cytosine would also bind to 
the bacterial enzyme 7w7TrpRS, and with —8 kcal/mol even 
slightly more exothermic than to iftTrpRS. Apparently, all 
polar groups of the cytosine molecule are in contact with 
atoms of the bacterial enzyme (see Figure 5, right panel). 
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Figure 6. Strength and specificity of JmTrpRS/anticodon base interac- 
tions. All structures were optimized with TPSS/TZVP or TPSS-D3 /TZVP. 
To demonstrate structural changes, the complex TmTrpRS/cytosine in 
oxidation state [Fe4S4] 1+ and optimized with TPSS/TZVP is always de- 
picted in black. Upper left panel: TmTrpRS/cytosine in oxidation state 
[Fe4S4] 2+ , Cyt: cytosine. Upper right panel: DnTrpRS/cytosine in oxi- 
dation state [Fe4S4] 1+ calculated with the explicit consideration of dis- 
persion interactions (TZVP-D3/TZVP), Cyt: cytosine. Lower left panel: 
JmTrpRS/2-thiocytosine in oxidation state [Fe4S4] 1+ , Cyt: cytosine, Thio- 
cyt: 2-thiocytosine. Lower right panel: TmTrpRS/uracil in oxidation state 
[Fe4S4] 1+ , Cyt: cytosine, Ura: uracil. Element color code — white: H, red: 
O, gray/orange: C, blue: N, yellow: S, brown: Fe. 
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Figure 7. Distribution of the Fe-S cluster sequence motif in TrpRS pro- 
teins among representatives of the bacterial and archaeal classes. The mul- 
tiple sequence alignment was generated with Clustal Omega (52,53). The 
organisms Thermotoga maritima and Aeropyrutn pernix, the TrpRS of 
which were studied quantum mechanically here, are highlighted in bold 
face. The cysteine residues of the consensus sequence 5'-C-X6~C-X2-C-3' 
are highlighted and correspond to C259, C266 and C269 in 7>nTrpRS. 



The carbonyl group of cytosine forms an H-bond with the 
peptide-NH of V267. The imine-N of the pyrimidine ring 
interacts with the peptide-NH of C266, whereas the cluster- 
coordinating S atom of this residue is in H-bonding contact 
with the NH2 group of the cytosine moiety. The N atom of 
the amino group apparently also interacts with the guani- 
dinium group of R224 which, in turn, is held in position due 
to its interaction with the cubane cluster and surrounding 
peptide groups (Figure 5). 
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Table 2. Energies for the binding of cytosine, 2-thiocytosine and uracil models to TrpRS models from Homo sapiens (//.sTrpRS), Thermotoga maritime! 
(TmTrpRS) and Aeropyrum pernix (ApTrpRS) 



XrpRS model 


Oxidation stflte 


Ligand 




Interaction Energy 










Dispersion Corrections 
+ 


STrpRS 




cytosine 


-6.5 


-32.3 


FrnTrpRS 


[Fe 4 S 4 ] 1+ / 2 + 


cytosine 


-7.8/-7.9 


-20.6/-20.6 


7)«TrpRS(H244A) 


[Fe 4 S 4 ]'+/ 2 + 


cytosine 


-7.9/-8.0 


-20.4/-20.0 


TmTrpRS 


[Fe 4 S 4 ] 1+ / 2 + 


2-thiocytosine 


-9.5/-8.9 


-21.7/-20.8 


FrnTrpRS 


[Fe 4 S 4 ]'+/ 2 + 


uracil 


+2.0/+5.1 


-5.6/-5.7 


ApTrpRS 


[Fe 4 S 4 ]'+/ 2 + 


cytosine 


-5.1/-4.7 


—18.6/— 16.7 



Energies are given in kcal/mol. Structural optimizations were performed with TPSS/TZVR with and without explicit dispersion corrections (TPSS-D3). 
For the archaeal and bacterial systems, ligand coordinates were derived by structural superposition with the i/.?TrpRS/5?-tRNA Tlp complex (PDB en- 
try 2DR2). Due to sterical clashes, the ligand molecules were subsequently relaxed by force field optimization prior to quantum mechanical structural 
optimizations. For DnTrpRS and yl/)TrpRS the first number refers to the [Fe 4 S 4 ] 1+ and the second one to the [Fe 4 S 4 ] 2+ oxidation state. 



Interestingly, the redox state of the Fe-S cluster has al- 
most no effect on binding strength. The interaction energies 
are virtually identical for both forms considered, [Fe4S4] 1+ 
and [Fe4S4] 2+ (see Table 2). Structurally, a change in the re- 
dox state leads only to small deviations of the cluster atoms 
and to only a slight movement of the cytosine ligand (see 
Figure 6, upper left panel). As in the case of //sTrpRS, 
dispersion interactions contribute significantly, but appar- 
ently non-specifically, to the interaction, because their ex- 
plicit inclusion results only in minor changes of the lig- 
and and TmTrpRS model coordinates (see Figure 6, up- 
per right panel). Replacing cytosine by 2-thiocytosine does 
not change this picture and the structural characteristics of 
the interaction remain the same. With approximately —9 
kcal/mol, the interaction with the enzyme model is even 
slightly stronger than in the the case of cytosine (see Ta- 
ble 2). The cytosine/2-thiocytosine exchange does not al- 
ter the binding mode of the base moiety with respect to the 
TwTrpRS model and therefore the binding partners for cy- 
tosine and 2-thiocytosine are the same (see Figure 6, lower 
left panel). Analogous results were obtained with explicit 
dispersion corrections (see Supplementary Figure SI). Due 
to the bigger van der Waals radius of sulfur as compared 
to oxygen, the distance between the thiocarbonyl group of 
2-thiocytosine and the NH-group of V267 is increased com- 
pared to the situation with cytosine. This results in a slight 
shift of the 2-thiocytosine molecule (see Figure 6, lower left 
panel). With about —10 kcal/mol, the background gener- 
ated by dispersion interactions remains almost unchanged 
(see Table 2). 

Taken together, these data strongly suggest that the struc- 
tural motif given by the Fe-S cluster and its surroundings 
provides a specific pattern of interaction partners for recog- 
nizing a cytosine or 2-thiocytosine 5' base in the anticodon 
loop of a tRNA Trp molecule. 

Specificity of base recognition 

As shown in the previous section, cytosine or 2-thiocytosine 
show clear affinities to TmTrpRS that are comparable to 
cytosine binding in the &TrpRS/5/-Trp-tRNA complex. 
This observation, however, directly raises the question of 
specificity, i.e. is the structural motif given by the [Fe4S 4 ] 
cluster and its embedding suited to discriminate between 



cognate and non-cognate bases? Given the structural char- 
acteristics of the bacterial enzyme, the binding of a purine 
base at the same position as cytosine or 2-thiocytosine 
seems sterically too demanding. Therefore, in an indepen- 
dent set of calculations, uracil was studied in order to de- 
termine whether or not its binding would be energetically 
sufficiently disfavored to allow for the specific recognition 
of cytosine/2-thiocytosine. 

The results in Table 2 demonstrate that uracil shows no 
affinity for the Tm TrpRS model if no dispersion interac- 
tions are considered. For both oxidation states, the bind- 
ing energetics is endothermic. As in the case of cytosine/2- 
thiocytosine, dispersion interactions contribute exothermi- 
cally on the order of —10 kcal/mol. For the given model 
sizes, i.e. the number of interacting atoms, this amount 
apparently represents the non-specific interaction between 
the base moiety and the enzyme. The structurally opti- 
mized coordinates that are depicted in Figure 6 (lower right 
panel) show that fewer favorable and specific interactions 
between uracil and the bacterial synthetase (in comparison 
to cytosine/2-thiocytosine) could be identified and the op- 
timization resulted in a reorientation of uracil with respect 
to cytosine due to sterical clashes, which explains the clear 
energetic penalty for the binding of uracil over cytosine/2- 
thiocytosine to TmTrpRS. With explicit dispersion correc- 
tions, a similar displacement of the uracil ligand could be 
observed (see Supplementary Figure SI). 

These data show that the putative binding motif formed 
by the Fe-S cubane and its direct protein environment could 
indeed serve as a pattern for the selective recognition of 
cytosine/2-thiocytosine in the 5' position of the tRNA an- 
ticodon loop, as it provides both affinity and specificity of 
binding. 

Significance and structural conservation of the R-FeS-H mo- 
tif 

Given that the Fe-S cluster in the TAB domain of ThiTrpRS 
with its characteristic embedding provides a specific and 
selective binding platform for cytosine/2-thiocytosine and 
could therefore be important for tRNA recognition and de- 
coding, as demonstrated above, the next question becomes 
whether the complete structural unit, that could be char- 
acterized as an arginine/Fe-S cluster/histidine (R-FeS-H) 
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Figure 8. Conservation of residues in the assumed cluster environments 
of the TrpRS proteins shown in Figure 8. The multiple sequence align- 
ment was generated with Clustal Omega (52,53). The organisms Thermo- 
toga maritime! and Aeropyrum pernix, the TrpRS of which were studied 
quantum mechanically here, are highlighted in bold face. The sequence 
positions explicitly considered in TmTrpRS and ^4/;TrpRS are highlighted 
and correspond to R224 in TmTrpRS, Q293 in ^TrpRS and H244 in 
DnTrpRS. 
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Figure 9. Potential binding of cytosine to 4/;TrpRS. Left panel: Superpo- 
sition of the starting model (black) extracted from the crystal structure of 
/4/;TrpRS (PDB entry 3A04) with the TPSS/TZVP structure (color). Right 
panel; TPSS/TZVP structure of the 4/jTrpRS/cytosine complex. For bet- 
ter visibility of cytosine binding, D309 and the crystal water are not shown. 
Distances are given in angstroms. Element color code — white: H, red: O, 
gray/orange: C, blue: N, yellow: S, brown: Fe. 



motif, is conserved and thus strictly required for the as- 
sumed function. 

Phylogenetic analysis done by taking TrpRS genes of rep- 
resentatives from all bacterial and archaeal classes into ac- 
count shows that both R224 and H242 are conserved in bac- 
teria which presumably contain an Fe-S cluster in their Tr- 
pRS enzymes, according to the occurrence of the cysteine- 
rich consensus sequence, 5'-C-X 6 -C-X 2 -C-3' (see Figure 7). 
In order to determine whether or not H242 is required for 
base recognition, a TfriTrpRS model was generated that car- 
ries an H242A mutation and for which cytosine binding en- 
ergies were determined by DFT calculations. Interestingly, 
despite being in H-bond contact with the S atom of C269, 
the substitution of H242 does not change the partial charges 
or the spin polarization on the cluster atoms significantly 
(see Table 1), in both oxidation states considered. Only the 



partial charge on the S atom of C269 becomes more posi- 
tive due to the missing H-bond otherwise formed with the 
imidazole side chain. This seems to be compensated for by a 
decrease of positive charge on the iron atom Fe3 of the clus- 
ter (compare Figure 2). Moreover, binding energy and bind- 
ing mode for cytosine are almost unchanged in the H242A 
mutant model (see Table 2 and Supplementary Figure SI). 

Therefore, the remarkably strong conservation of H242 
in the bacterial TrpRS proteins phylogenetically considered 
here seems not to be directly related to enzyme/base interac- 
tion and could be due to its involvement in other processes 
such as cluster bio-synthesis and assembly/insertion. 

Phylogenetic analysis shows that H242 is not conserved in 
archaeal representatives of the TrpRS enzyme class (see Fig- 
ure 8). Notably, only one crystal structure of an archaeal Tr- 
pRS carrying a cubane cluster — from the organism Aeropy- 
rum pernix Kl — has been published to date (/!/>TrpRS) (8) 
(PDB entry 3A04). The TAB domain region with its Fe-S 
cluster is shown in Figure 9 (left panel). The cluster is again 
coordinated by an amino acid that is surface-exposed, as 
in 7wTrpRS, but it is a glutamine residue (Q293) instead 
of an arginine. Similarly, H242 that is conserved in bacte- 
rial species is missing in ^4/TrpRS and replaced by an as- 
partate, D309. Due to this exchange, the carboxyl group of 
D309 is not in direct contact with the cluster. According to 
the refinement of Tsuchiya et al. (8), a crystal water is lo- 
cated between D309 and the S atom of C337 (see Figure 
9, left panel). Because of this apparent hydrogen bonding 
network, D309 and the water molecule were included in a 
minimal model of ApTrpRS (Figure 9, left panel). In order 
to study the binding of cytosine, Q293 (which according to 
structural alignment adopts the place of R224 in TmTrpRS) 
was considered for model construction. Furthermore, main 
chain atoms of the direct protein environment of the cubane 
and Q293 were included. 

The binding energetics documented in Table 2 show that 
cytosine also binds to the model of the archaeal enzyme 
^4/TrpRS, albeit with a somewhat lower affinity (around —5 
kcal/mol). The binding mode of the cytosine moiety is com- 
parable to that which was observed in TmTrpRS, with hy- 
drogen bonds between the NH 2 group of cytosine and the 
S atom of a cluster-coordinating cysteine (C337) (see Fig- 
ure 9, right panel). The same NH 2 group also interacts with 
the carbonyl-0 atom of the amide group of Q293. The 3'- 
N of the pyrimidine ring forms a hydrogen acceptor for the 
peptide NH group of C337; and the pyrimidine 's carbonyl- 
O interacts with the peptide NH group of G335. The in- 
teraction between a 5' cytosine and ^TrpRS calculated 
here is corroborated by an experimental study of Tsuchiya 
and Hasegawa, who investigated the binding of tRNA 
molecules that were obtained by an in vitro transcription 
approach, to overexpressed ApTrpRS. They were able to 
demonstrate that mutating the cytosine residues in the tryp- 
tophan anticodon, 5'-CCA-3', severely impedes aminoacy- 
lation, whereas adenine mutants show wild-type activity (8). 
The potential hydrogen bonding network formed by C337, 
D309, and the crystal water was not conserved during struc- 
tural optimization and seems not to be important for clus- 
ter function. Rather, the water molecule changed position 
and formed a hydrogen bond with the peptide carbonyl-0 
of C337 (see Figure 9, left panel). 



Nucleic Acids Research, 2014, Vol. 42, No. 9 5433 



Apparently, recognition of a 5' base could be accom- 
plished by different choices of amino acid side chains 
around an Fe-S cluster. As can be seen from the alignment 
in Figure 8, not all bacterial TrpRS analyzed here feature 
an arginine residue at the sequence position correspond- 
ing to R224 in 7mTrpRS. In the enzymes from Chlorobium 
ferrooxidans and Aquifex aeolicus, arginine is replaced by 
lysine. Due to comparable side chain lengths of arginine 
and lysine, it seems very likely that lysine can fulfill the role 
of R224 in Tm TrpRS by interacting with the NFb group 
of cytosine. In the archaeal candidates, this sequence posi- 
tion does not show any signs of conservation. Instead, Q293 
serves the function of recognizing the 5' base in Aeropyrum 
pernix as discussed above. Interestingly, Q293 is replaced by 
histidine in Methanopyrus kandleri and Methanocaldococ- 
cus jannashii. 

Secondary adaptation or remnant of early life? 

Iron-sulfur clusters can be considered an early invention 
in the emergence of life. Iron-sulfur proteins are gener- 
ally highly conserved and appear at crucial positions in 
the metabolism of cells. The iron-sulfur world hypothesis 
even claims that the evolution of a metabolism preceded 
the invention of a genetic code (18). On the other hand, 
RNA molecules have always been considered attractive can- 
didates for early life because they can at the same time pos- 
sess enzymatic activity, i.e. as ribozymes, and carry genetic 
information. An iron-sulfur enzyme like TmTrpRS that is 
involved in deciphering nucleic-acid encoded information 
and protein biosynthesis, therefore could echo in a present- 
day organism a possible convergence of iron-sulfur and ri- 
bonucleic acid-based origins of life. As tempting as such an 
explanation might be, it must be noted that the occurrence 
of iron-sulfur clusters in TmTrpRS or ^/?TrpRS could be a 
mere secondary adaptation to extreme environmental con- 
ditions. 

It is difficult to answer this question by phylogeny based 
on a single gene, because the high degree of lateral gene 
transfer inside and between the kingdoms bacteria and ar- 
chaea (46-48) prevents us from clearly determining whether 
a TrpRS gene with an iron-sulfur cluster binding domain 
represents the chronologically older variant. The align- 
ment shown here does not provide an exhaustive overview 
of bacterial and archaeal TrpRS gene distribution. How- 
ever, it does reveal a number of intriguing details. Inter- 
estingly, the distribution of putative Fe-S clusters is not 
linked to certain bacterial classes. For example, in the 
class gamma-proteobacteria, the prototypical enterobac- 
terium E. coli carries a TrpRS lacking the characteristic 
cysteine -rich consensus sequence indicative of a [Fe4S4] 
cluster, 5'-C-X 6 -C-X 2 -C-3', whereas in the order Legionel- 
lales and Pseudomonadales, TrpRS that most probably con- 
tain the cluster occur. Examples are the medically relevant 
species Legionella pneumophila and Pseudomonas stutzeri 
that clearly do not belong to the group of thermophilic bac- 
teria (see Figure 7). In order to prevent false positive results, 
comparative modeling was applied to confirm that the se- 
quences are in accordance with the protein fold of an Fe-S 
cluster-containing TAB domain (see Supplementary Figure 
S2 and Table S2). Given the subtle structural interplay in 




Figure 10. A missing [Fe4S4] cluster in TrpRS from Pyrococcus horikoshif! 
Left panel: Putative TAB domain of Ph TrpRS with the four free cysteine 
residues highlighted (PDB entry 3JXE (51)). The black arrows assign the 
strand break in the crystal structure. Right panel: [Fe4S4] cluster and co- 
ordinating cysteine side chains (black) from the TmTrpRS structure 2G36 
positioned in the TAB domain of Ph TrpRS by superimposing the crystal 
structures of iViTrpRS and TmTrpRS. 



the recognition of tRNA by TrpRS, an Fe-S cluster in the 
TAB domain seems plausible under extreme environmental 
conditions due to the structural rigidity the cubane scaffold 
provides. However, the occurrence of Fe-S cluster-free Tr- 
pRS in thermophilic bacteria like Thermus thermophilus or 
archaea like Sulfolobus solfataricus (see Figure 7) shows that 
this motif is not strictly required for translational fidelity. 

A curiosity in this respect is the parasitic or symbiotic re- 
lationship between the archaebacteriae Nanoarchaeum eq- 
uitans and Ignicoccus hospitalis (49). The extremophile or- 
ganism N. equitans lives near hydrothermal vents. It requires 
the presence of a host organism because its reduced genome 
lacks several enzymes required to sustain essential path- 
ways. Interestingly, a Trp-tRNA-synthetase is present (like 
all gene products required for replication, transcription and 
translation (49,50)) but does not contain a single cysteine 
residue and is therefore unlikely to contain an Fe-S clus- 
ter. In contrast, potential host organisms existing in the 
same habitat, like Ignicoccus hospitalis, carry an Fe-S clus- 
ter consensus sequence (see Figure 7). The strict colocaliza- 
tion of both organisms therefore further calls into question 
the claim that Fe-S clusters represent a mere adaptation for 
Trp-tRNA aminoacylation under extreme conditions. 

Clearly, sequence homology alone has to be considered 
carefully. This becomes especially apparent in the example 
of Pyrococcus horikoshii whose TrpRS protein (Ph TrpRS) 
features an Fe-S cluster consensus sequence (see Figure 
7). Surprisingly, a crystal structure of the synthetase that 
has been solved (51) does not show an iron-sulfur clus- 
ter. However, closer inspection of the TAB domain region 
and comparison with the TmTrpRS structure shows that in 
P/zTrpRS, four cysteine residues are located in the same way 
as in TniTrpRS and should therefore be suited to accommo- 
date and coordinate an Fe-S cubane (see Figure 10). The 
P/zTrpRS crystal structure features a strand break and the 
quality of the electron density in the TAB domain is rather 
low (not shown); this might be due to an increased flexibility 
induced by the free four cysteine residues. An explanation 
for the missing [Fe4S4] cluster could be that the overexpres- 
sion system (E. coli strain BL21) that was used for isolating 
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WiTrpRS does not provide the proper P. horikoshii-speciHc 
Fe-S cluster maturation and insertion machinery and that 
the native protein indeed carries a cubane cluster. Accord- 
ing to the phylogenetic and structural analysis provided by 
Dong et ah, the origin of TrpRS lies in the kingdom ar- 
chaea and was acquired subsequently by bacterial ancestors 
of extant organisms (51). A convergence of iron-sulfur and 
RNA-based life exemplified by Fe-S cluster-containing Tr- 
pRS proteins seems therefore plausible. 

Interestingly, an Fe-S cluster consensus motif is also 
present in the TrpRS of the bacterium Desulforudis audaxvi- 
ator that was discovered in 2008 in a South African gold 
mine at 2. 8 -km depth and lives in complete autarky in this 
isolated ecosystem (54). The high sequence homology with 
ZmTrpRS (52% identical residues) allows for reliable com- 
parative modeling and shows that an Fe-S cluster should 
be present even in Z>aTrpRS (see Supplementary Figure S2 
and Table S2). 

CONCLUSION 

Attributing a role to a well-known structural unit that 
appears in an unprecedented context can be difficult. As 
shown here, computational chemistry can complement bio- 
chemical experiments in the interpretation of experimen- 
tal findings. Theoretical modeling of iron-sulfur proteins 
is challenging because transition metal complexes require a 
quantum mechanical description that results in significant 
computational effort, limiting thereby the model sizes that 
can be treated. Nevertheless, the data provided here clearly 
support the idea that iron-sulfur clusters in prokaryotic and 
archaeal TrpRS proteins are able to provide affinity and 
specificity for deciphering the 5' base in the anticodon loop 
of their cognate tRNA Trp molecules by providing a stable 
structural pattern that assures affinity to the base and at the 
same time specificity by producing an energetic and struc- 
tural penalty against the binding of non-cognate base moi- 
eties. The models considered here suggest that it is mostly 
the thiolate-S atom of a cluster-coordinating cysteine that 
directly interacts with the base. Clearly, an [Fe4S/i] cluster 
is perfectly suited to provide this functionality in a struc- 
turally restrained setting that is robust against structural 
disturbances and changes in the redox conditions of the cell. 
In comparison to the typical role Fe-S clusters play in elec- 
tron transport, this function seems not to be of major im- 
portance for the specific recognition of anticodon bases, as 
no significant differences could be observed for the two re- 
dox states analyzed. 

The occurrence of [Fe4S4] cluster-free TrpRS representa- 
tives in thermophilic organisms demonstrates that the prob- 
lem of anticodon recognition can be solved even under 
extreme conditions without additional cofactors, and the 
question remains why the energetically demanding process 
of cluster formation is maintained in extant organisms. It 
might very well be that iron-sulfur clusters in the context 
of tRNA decoding are remnants of an early convergence of 
RNA and iron-sulfur-based life forms in a setting that pro- 
vided a plethora of metal sulfides, as do hydrothermal vents. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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